1. Introduction

The condition of driving is totally different in the night-time. This is due to various factors like reduction in the visibility and it can be very difficult to see road users like pedestrians, cyclists, and motorcyclists. UK road casualty studies by The Royal Society for the Prevention of Accidents (ROSPA) have shown 40% of collisions occur during darkness. Also, it was shown in certain studies that there is a link between weather and road accident risk. A bad weather like heavy rain and high wind increases the probability of a road accident. A heavy rain reduces the visibility of drivers and heavy wind cause an imbalance to cyclists and motorcycle riders. Certain studies reveal that there is a significant correlation between weather condition and aggregate number of accidents. This correlation is impacted based on the road surface condition as well.

This report will be looking to identify the answer for the question, ‘to what extent do weather and light conditions affect crashes?’ using transport data science techniques. Road accidents data for UK is available and greatly encouraged to use it for academic researches. The aread of study for this report is all the accidents happened in various LSOAs of London in the year 2019.

1.1 Scope

The scope of this report is to analyse the impact of weather condition and night-time on accidents. For this, data of accidents happened in various LSOAs of London during the year 2019 were analysed. The number of casualties were looked up on for accidents during night-time and bad weather considering the road surface condition and light condition. The analysis starts with data cleaning followed by data understanding and data exploration. Section 3 will focus on Night time accidents and the casualties. Section 3 will focus on casualties during bad weather and night time. Section 4 will find out the 10 LSOAs where most number of accidents are happening at night and during bad weather. Section 5 will study what kind of road condition and light condition is causing more casualties. Further sections will try to propose policy recommendations for night time journeys and during bad weather. None of the analysis is without limitations. The limitation of our analysis will be discussed in the last section.

1.2 Area of Study

London is the capital and largest city of United Kingdom. As per the road traffic statistics, in 2019, 22.6 billion vehicle miles of traffic were travelled across the 9.2 thousand miles of roads in London. Evidently the number of accidents in London should be more than any other city. London is divided into 32 local authority districts called the London Boroughs. There is a geographical hierarchy in England and Wales called Lower Layer Super Output Areas (LSOA) to improve the reporting of small area statistics. In our data, each of the accident is marked with the LSOA where it happened. There are 4835 LSOAs in London and s will be used for the geographical plotting in this report. Out of all the accidents reported in UK, 21% of accidents were in London which will make it an important area of accident study in UK.

1.3. Datasets

The accident data used for the analysis is from a package called stats19. Through this package we can access to Great Britain’s official road traffic casualty database – Stats19. For our analysis, three data sets of 2019 accidents were used – Accidents , Casualties and Vehicles. To get the geometric zones of all the LSOAs of London, a package called Propensity to Cycle Tool (PCT) was used. In order to distinguish night-time accidents with day time accidents, sunset/sunrise data is also used which indicates sunrise and sunset time of each day of 2019.

2. Data Understanding and Pre-Processing

For any data analysis, understanding the data set is very important. The data might not be present in the format which is required for our analysis. So it needs to be processed to meet our requirement. The process which was followed to prepare the data is explained in below sections.

2.1 Understanding the Datasets

Data set was also downloaded as csv files. The column headers were clearly indicating the meaning of each columns. Also stats19 package has two metadata data frames, stats19_schema and stats19_variables where each of the columns are clearly explained. The data set is filled with certain codes for variable like police_force, light_conditions. The meaning of each of these codes are explained in the metadata variables which is of great help in understanding the data set.

2.2 Data Preparation

After understanding the data set, next step was to filter out the data of accidents in London which is of our interest. For this, data was filtered on the column police_force with values “Metropolitan Police”, “City of London”. The R function filter() was used for this purpose. It was found that there were 25338 accidents happened in London in the year 2019. Then this was joined with the casualty data using left_join() function. This data was then joined with sunrise & sunset data with date of accidents as the joining key. The sunrise & sunset data contains the time of sunrise and sunset of each day in 2019The purpose of this was to filter out the accidents which happened after sunset and before sunrise. These accidents will be classified as night accidents. The lubridate package was used to work with dates and time. All such data were extracted to a new data set named night_accidents.

The accidents data set has a column weather_conditions indicating the weather of the region when accident occurred. Different weather conditions which were classifed as unfavourable includes Rain, high winds, Snow, fog, or mist. These weather conditions were classified unfavourable because it could have impacted the visibility of the driver.

Data cleaning is also an important step in data preparation. There were 39 accidents where the time of accident happened was missing. Since time is an important data for our analysis, it is better to update it with an optimal value instead of deleting it. To fill this missing data, average time of 10 adjacent dates accidents of the same LSOAs were taken and updated over there. In this way it is made sure that the records are preserved and not all records updated with same value.Instead it will have different values based on adjacent records.

Not all columns were required for further analysis. Out of 48 variables altogether, 21 were chosen for further analysis. Also, there were multiple records with same accident indexes, possibly due to multiple casualties. The duplicates were removed.

After data cleaning and joining operations, this data was saved to local machine using write.csv() function so that this can be reused on accidental deletion or modification in the cleaned data set.

2.3 Exploratory Data Analysis

As part of exploratory data analysis, a summary of accidents happened at night and due to bad weather were found. Then these were plotted as histogram as below. 28% of total accidents happened in London were at night. Out of the total accidents in London, 14% of them could be classified to occur during a unfavourable weather for driving. These numbers indicate there could be a relationship between accidents during night driving and different weather conditions.

Number of accidents happened at night time and due to bad weather in 2019Number of accidents happened at night time and due to bad weather in 2019

Number of accidents happened at night time and due to bad weather in 2019

3. Find who is more affected with the accidents

At this part of the report, we will be analysing the casualties happened on accidents at night and during bad weather. In this way we will get to know who is mostly affected during these accidents. Various casualty types to be analysed are Car occupant, cyclist, Motorcylist, Pedestrians, Heavy Goods Vehicle(HGV). There were many casualties other than these six but those were not significant in numbers. So those were classified as Other.

3.1 Methodology

The geom_bar() function from the package ggplot2 was used to draw the bar plots of casualty types against number of deaths. These plots give indications on travellers of which vehicle were mostly affected in these accidents.

3.2 Results and Discussions

The number of casualties were more for car occupants during night accidents which was close to 3000. This was followed by motorcycle with 1519 casualties, then pedestrians with 1411 casualties and cyclists with 984 casualties. Pedestrians and cyclists are considered to be the vulnerable categories in a road journey . The data indicates this vulnerability is equally or even more at night. The decrease in visibility at night increases the risk for cyclists and pedestrians. The number of casulties in HGV is comparitively very less .

To identify whether age plays any factor for pedestrians to meet with accidents, a histogram was plotted with age of pedestrians against number of casualties. Maximum number of casualties were occuring between age group of 25-50. Also it can be observed more number of male pedestrians are meeting with accidents compare to female pedestrians at most age groups.

Night - Casualty Groups and age distribution of pedestriansNight - Casualty Groups and age distribution of pedestrians

Night - Casualty Groups and age distribution of pedestrians

For weather related accidents, the result almost follows the similar trend of night accidents where most casualties were for car occupants (1305) followed by pedestrians (860), motorcyclist (784), and cyclist (551). A bad weather like rain or snow makes the road wet and increases the risk of accidents for cyclists and motorcyclists. Similarly heavy rain can reduce the visibility of drivers which put pedestrians at higher risks. To investigate which age groups of pedestrians are more vulnerable to these conditions, a histogram was plotted with age of casualty versus number of casualties. Most of the casualties were for age group of 15-30. Younger people including teenagers are very vulnerable to these conditions as these are the people who mostly walk to schools, universities, and work purposes.

Weather related- Casualty Groups and age distribution of pedestriansWeather related- Casualty Groups and age distribution of pedestrians

Weather related- Casualty Groups and age distribution of pedestrians

4. Geospatial Distribution of Accidents in London

In this section we will be looking which were the most accident prone locations at night time and when there was unfavourable weather during 2019. We will be locating 10 LSOAs in London where the accidents rates were higher in these conditions.

4.1. Methodology

The data from stats19 has LSOAs where accidents were reported along with the latitude and longitude of the accident location. To do the spatial plotting of the LSOA, the geometric boundaries of them are required. Since this feature is not present in stats19 data, it was downloaded separately from a package called Propensity to Cycle Tool (PCT). This was then joined with the stats19 data set with joining key as LSOA of accident location.

Then using group_by() and summarise(count = n()) function, the number of accidents occurred in each of the LSOA in the year 2019 was calculated. This is then sorted in the descending order and the top 10 LSOAs were picked using top_n() function. These LSOAs were then plotted on top of layer of London using functions from tmap package. Similar strategy was also applied for weather data and plotted 10 LSOAs for this category as well.

4.2 Results and Discussions

Figure below shows the thematic London map with geo spatial distribution of 10 most accident prone LSOAs. The most number of accidents at night were reported in LSOA E01002444 which is the LSOA of City of London. London City is one of the busiest cities in the world where there will be travelling irrespective of day or night.

Total number of accidents at night was 53 which is against the total number of accidents in the whole year was 195. In these 195 accidents, there were 212 casualties out of which 67 happened at night. This is 31% of total casualties which indicates the number of night accidents is significantly higher at night. Out of the 11 serious casualties in this LSOA at night, 7 of them were pedestrians and 3 of them were cyclists indicating higher risk of serious casualty for these groups.

Westminster, Hackney and Hillingdon followed City of London in number of night accidents.

10 most accident occuring LSOAs at night in London

10 most accident occuring LSOAs at night in London

In the case of bad weather also, LSOA E01032739 for City of London has the highest accident rate which is 19. This is followed by 3 LSOAs under Westminster, E01033595,E01004733 and E01004736 with accident rates of 15, 13, 12 respectivel. Out of the 19 accidents, 18 casualties were reported for either two wheeler drivers or pedestrians. The roads might be wet which makes it slippery for motorcylclists, cyclists and pedestrians.

10 most accident occuring LSOAs due to bad weather in London

10 most accident occuring LSOAs due to bad weather in London

5. Investigation of factors affecting accidents

5.1 Accidents happening at night

We have investigated with 3 factors which could be a reason for accidents happening at night, Light Condition of roads, Road Surface condition and speed limit in that road. For this we have plotted bar plots of these factors against the number of accidents.

Lighting Condition – There are 687 cases where lighting condition is unknown. Since we cannot assume the lighting condition of a road by any mean, we will ignore it. Then only 2% of accidents are happening with a bad light condition. This indicates lighting condition could not be a big factor for the accidents happened at night as most of the roads in London might be well lit at night.

Road Surface Condition - All the road surface condition other than dry makes the road wet, we will consider all these conditions as wet surface. In that case, 2379 accidents out of 7134 happened when the road surface is wet. This 33% of total accidents at night. So, Road Surface Condition has a bigger impact on night accidents.

Speed Limit – Around 90 % of accidents happened on a road where speed limit is less than 30 miles per hour. Since it is night-time, the vehicles might be driving at a speed higher than the speed limit of the road. This can contribute to the accident rates at night.

Factors affecting number of night accidentsFactors affecting number of night accidentsFactors affecting number of night accidents

Factors affecting number of night accidents

5.2 Accidents due to bad weather

Like night accidents, 3 same factors were investigated to check whether any of those external factors can impact accidents during an unfavourable weather. Bar plots were plotted with these factors against number of accidents.

Lighting Condition – From the bar plot it is clear that lighting condition doesn’t have much impact on accidents during a bad weather.

Road Surface Condition- 92% of accidents during bad weather is associated with a wet road. So road surface condition with a wet road has a huge impact on accidents. Speed Limit – Around 90 % of accidents during bad weather happened on a road where speed limit is less than 30 miles per hour. As we have seen, since most of the accidents were happening on a wet road, if vehicles go beyond speed limit there are more risks for an accident.

Factors affecting number of accidents due to bad weatherFactors affecting number of accidents due to bad weatherFactors affecting number of accidents due to bad weather

Factors affecting number of accidents due to bad weather

6. Policy Recommendations

As per the above analysis, we have noticed how a bad weather or night-time is increasing the risk of road accidents. We have seen that these conditions pose risk of casualties for pedestrians, cyclists, and motorcyclists. There are three factors which we observed to improve the road safety for the general during these conditions.

Upgrading existing signal systems and cycle tracks: Most of the accidents happened at night and during bad weather involves casualties with pedestrians, cyclists, and motorcyclists. We have seen in section 4 that most number of accidents were happing in City of London and serious casualties were happening to pedestrians and cyclists. Pedestrians are commonly injured when crossing the road. If we can introduce modern technique like split cycle offset optimisation (SCOOT) in more signal points, it allows signals to adjust timings based on the number of pedestrians detected at a crossing waiting to cross. Also, it needs to be checked if there are footway clutters like street furniture which causes injuries to pedestrians. For cyclists, we need to check whether number of cycle tracks are proportionally getting increased based on the increase in number of cycle riders.

Dealing with wet roads: As we have seen wet roads play a significant increase in the risk of causing road accidents. London sees lots of rain around the year which makes the road surface wet most time of the year. Cyclists can take safety measures like improve the grip of tyres with a slick tread which keeps more rubber to contact with tarmac which gives better grip. Also, they need to ensure to helmets with cycling caps under helmet which will deflect the rain away from the eyes. Also, it can be checked to enforce a law to make helmets mandatory for cyclist at least inside cities like London.

Enforcement of traffic laws: Obeying the traffic rules is very important for a safe road journey. At night and during a bad weather, there are chances of people being careless about traffic rules. This poses serious risk of getting involved in accidents especially for pedestrian and cyclists. So, if we can particularly make sure traffic laws are being enforced during these times and at the places where more accidents are being reported. One suggestion is to automate traffic enforcement by implementing more cameras to detect traffic violation. Moreover, pedestrian also needs to make sure they follow traffic rules and use the facilities like signals efficiently to avoid any harm to themselves.

8.Conclusion

The condition of driving at night times and during a bad weather is totally different from other times. This is mainly due to the reduction in visibility due to darkness or a heavy rain. The distance a driver can see gets shortened at night. Also need of eyes to get adjusted with the lighting after driving through a well lit road or a building is also a reason for night time accidents. Also, we have seen how road surface conditions like wet roads and speed limit become an important factor of road accident risks.

Through geospatial plotting we have uncovered the list of LSOAs where more number of accidents are happening at night and during a bad weather. In the year 2019, most such accidents occurred in City of London. Also, we have found car occupants, pedestrians, cyclists, and motor cyclists are the most affect group in these conditions. This report also gives different policy recommendation to avoid risk of road accidents at these times.

References

https://www.rospa.com/rospaweb/docs/advice-services/road-safety/drivers/driving-at-night.pdf

https://core.ac.uk/download/pdf/232568852.pdf

https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/463100/weather-on-road-casualties.pdf

BRSI, 2014, Are there more accidents in the rain? Exploratory analysis of the influence of weather conditions on the number of road accidents in Belgium

https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/358190/rrcgb2013-03.pdf

http://content.tfl.gov.uk/cycle-safety-action-plan.pdf http://content.tfl.gov.uk/pedestrian-safety-action-plan.pdf https://itsleeds.github.io/rrsrr/